Molecular and Structural Parallels between Gluten Pathogenic Peptides and Bacterial-Derived Proteins by Bioinformatics Analysis

Vazquez, Diego S.; Schilbert, Hanna M.; Dodero, Veronica I.

doi:10.3390/ijms22179278

Open AccessArticle

Molecular and Structural Parallels between Gluten Pathogenic Peptides and Bacterial-Derived Proteins by Bioinformatics Analysis

by

Diego S. Vazquez

^1,2,†,

Hanna M. Schilbert

^3,†,‡

and

Veronica I. Dodero

^3,*

¹

Grupo de Biología Estructural y Biotecnología (GBEyB-IMBICE), Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Roque Sáenz Peña 352, Bernal B1876BXD, Buenos Aires, Argentina

²

Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Av. Rivadavia 1917, Ciudad Autónoma C1033AAJ, Buenos Aires, Argentina

³

Department of Chemistry, Organic Chemistry OCIII, Universität Bielefeld, Universitätsstraße 25, 33615 Bielefeld, Germany

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

^‡

Current address: Genetics and Genomics of Plants, Center for Biotechnology (CeBiTec) and Faculty of Biology, Universität Bielefeld, Universitätsstraße 25, 33615 Bielefeld, Germany.

Int. J. Mol. Sci. 2021, 22(17), 9278; https://doi.org/10.3390/ijms22179278

Submission received: 12 August 2021 / Revised: 23 August 2021 / Accepted: 25 August 2021 / Published: 27 August 2021

(This article belongs to the Special Issue Human Gut Microbiome and Diet in Health and Disease)

Download

Browse Figures

Versions Notes

Abstract

:

Gluten-related disorders (GRDs) are a group of diseases that involve the activation of the immune system triggered by the ingestion of gluten, with a worldwide prevalence of 5%. Among them, Celiac disease (CeD) is a T-cell-mediated autoimmune disease causing a plethora of symptoms from diarrhea and malabsorption to lymphoma. Even though GRDs have been intensively studied, the environmental triggers promoting the diverse reactions to gluten proteins in susceptible individuals remain elusive. It has been proposed that pathogens could act as disease-causing environmental triggers of CeD by molecular mimicry mechanisms. Additionally, it could also be possible that unrecognized molecular, structural, and physical parallels between gluten and pathogens have a relevant role. Herein, we report sequence, structural and physical similarities of the two most relevant gluten peptides, the 33-mer and p31-43 gliadin peptides, with bacterial pathogens using bioinformatics going beyond the molecular mimicry hypothesis. First, a stringent BLASTp search using the two gliadin peptides identified high sequence similarity regions within pathogen-derived proteins, e.g., extracellular proteins from Streptococcus pneumoniae and Granulicatella sp. Second, molecular dynamics calculations of an updated α-2-gliadin model revealed close spatial localization and solvent-exposure of the 33-mer and p31-43 peptide, which was compared with the pathogen-related proteins by homology models and localization predictors. We found putative functions of the identified pathogen-derived sequence by identifying T-cell epitopes and SH3/WW-binding domains. Finally, shape and size parallels between the pathogens and the superstructures of gliadin peptides gave rise to novel hypotheses about activation of innate immunity and dysbiosis. Based on our structural findings and the similarities with the bacterial pathogens, evidence emerges that these pathologically relevant gluten-derived peptides could behave as non-replicating pathogens opening new research questions in the interface of innate immunity, microbiome, and food research.

Keywords:

celiac disease; non-celiac gluten sensitivity; SH3 and WW domains; 33-mer peptide; p31-43 peptide; gliadin epitopes; pathogens; sequence similarity; innate immune response

1. Introduction

Celiac disease (CeD) is a chronic, small-intestinal T-cell-mediated autoimmune disease triggered by the ingestion of dietary gluten from common food grains such as wheat, rye, and barley, in genetically predisposed individuals with a prevalence of about 1% in the general population with regional differences [1]. Other gluten-related disorders (GRDs) such as non-celiac gluten sensitivity (NCGS) are less understood and have a prevalence rate between 0.5% and 5% [2]. Nowadays, the only proven treatment for GRDs is strict and life-long adherence to a gluten-free diet [3].

Viral and bacterial pathogens have long been suspected of triggering immune responses that are directed toward autoimmunity in CeD. In 2002, the group of Khosla showed that the immunodominant gluten fragment has sequence similarity with pertactin, a highly immunogenic protein from the bacterium Bordetella pertussis; however, this result was not further investigated [4]. It recently identified and characterized a number of mimics of HLA-DQ2.5-restricted gliadin determinants derived from the commensal bacterium Pseudomonas fluorescens, activating disease-relevant gliadin reactive T cells isolated from CeD patients [5]. This report was a major proof of concept that a molecular mimicry mechanism may trigger CeD.

Beyond T-cell activation in CeD, it was also proposed that gluten proteins have functional similarities with non-replicative pathogens such as prions [6,7]. The main issue with gluten proteins is that the human digestive proteases can only partially degrade them leading to a mixture of peptides that elicit immune and toxic effects in predisposed individuals and cell lines [8]. The mononuclear phagocytic system (MPS), composed mainly of macrophages, neutrophils, and dendritic cells, is part of the first line of defense against pathogens. Pathogens are recognized by various immune cells, such as macrophages and dendritic cells, via pathogen-associated molecular patterns (PAMPs) on the pathogen surface, which interact with complementary pattern-recognition receptors (PRRs) on the pathogen surface the immune cell surfaces. While PRR activation is a central component to the resulting immune response, innate cells also respond to the size and shape of the pathogens and the spatial organization of PAMPs on the bacterial surface. In recent years, it has become evident that the MPS can engulf other non-self-systems such as synthetic nanoparticles generating in many cases an immune response against the foreign nanosystem [9,10]. From a pathological standpoint, there are some concerns in the possibility that the presence of nanostructures could exacerbate or prolong PRR-driven inflammatory reactions, leading to uncontrolled tissue-damaging inflammation. In this context, it was demonstrated that pepsin digests of gliadin form spontaneously amyloid-like structures that trigger genes in the gut epithelial cell model Caco-2 involved in recruiting specialized immune cells [11]. Furthermore, the most studied gluten peptides are 33-mer peptides and the p31-43 fragment that form peptide nanostructures, too. Both peptides trigger the adaptive or an innate immune response, respectively, in CeD patients, animal models, and cell lines culture [12,13,14,15,16].

The immunodominant 33-mer peptide comprises residues 57 to 89 of α-2-gliadin (LQLQPFPQPQLPYPQPQLPYPQPQLPYPQPQPF). In total, 39% of the 33-mer peptide residues are prolines leading to a type-II polyproline (PPII) conformation in solution, which is known to be bound to MHC class-II molecules [14]. When the 33-mer accumulates, it forms superstructures ranging from dimers to nano- and microstructures ranging from 10 nm to more than 1 µm [17,18,19]. Importantly, there is a concentration-dependent structural transition from PPII toward a parallel β-sheet conformation accompanied by the formation of large superstructures that activate macrophages in vitro via Toll-like receptor 4 (TLR4) and TLR2 [20,21]. The mentioned secondary structures are essential in protein-protein interaction and might function as a signaling component. The high percentage of Q makes this peptide amphiphilic favoring the formation of hydrogen bonds [22], a key factor in the self-assembling process, stressing the importance between sequence and morphology [17].

The second most studied gliadin peptide is the toxic p-31-43, a 13-mer peptide comprising the amino acids 31 to 43 (FPGQQQPFPPQQP). Although the p31-43 peptide is not presented by the HLA-DQ2 [23], the reason for its toxicity in CeD remains unknown. The mechanism by which p31-43 induces an immune response in celiac patients has been recently attributed to its effects on the endocytic compartment affecting several cell functions such as proliferation, cell motility, and innate immunity activation [23]. Nanayakkara et al. [24] recently proposed that p31-43 induces the IFN-α-mediated innate immune response in the CaCo-2 enterocyte cell-line by activating the TLR7 signaling pathway mimicking the immune response triggered by viruses. Recently, it was reported that p31-43 has a PPII secondary structure and can, as well as 33-mer, self-assemble under physiologically relevant conditions [25]. Even more, p31-43 oligomers have been proposed to be responsible for activating the inflammasome in murine models [26].

Finally, it is reported that gliadin acts as a modulator of human microbiota [27], and it is not clear if changes in the microbiota lead to gluten-related disorders or that the presence of gluten peptides in the gut leads to changes in the microbiota [28,29].

Based on the reported experimental evidence in cell lines, animal models, and patients’ biopsies, it may also be possible that gluten fragments share additional structural and morphological similarities with bacterial or viral pathogens beyond the established T-cell cross-reactivity in CeD, which would be particularly relevant in the understanding of the innate immune activation.

Herein, we showed using bioinformatics that the two most relevant gliadin peptides 33-mer and p31-43 have sequence and structural similarities with proteins found in bacterial pathogens giving novel insights into the early innate immune response. First, we performed a stringent Basic Local Alignment Search Tool protein (BLASTp) search [30] based on the 33-mer and p31-43 sequences outside the Gramineae family and identified high sequence similarity with proteins from different bacterial pathogens. Among them are Streptococcus pneumoniae and Granulicatella sp., two well-known causative agents of diverse human diseases. Second, we investigated the spatial localization of both gluten fragments in α-2-gliadin using molecular modeling and calculated sequence-based intrinsic disorder (ID) tendency and subsequent calculations of solvent-accessible surface area (SASA) and aggregation propensity. Next, we employed tridimensional homology modeling to analyze the structural similarities between the target pathogen-derived proteins and the 33-mer and p31-43 sequences in α-2-gliadin, performing a comprehensive literature search to identify pathogenic similarities. Importantly, in those pathogenic proteins, with sequence similarity with the 33-mer fragment, putative T-cell epitopes, and SH3-binding domains were found. Finally, the shape and size similarities between the superstructures of the gliadin peptides and the reported pathogens were presented and connected with morphology mimicry and dysbiosis (Figure 1).

2. Results and Discussion

2.1. High Sequence and Structural Similarity of 33-mer and p31-43 Sequences in Pathogen-Derived Proteins

One key unsolved question in GRDs is the environmental triggers that promote the diverse reactions to gluten proteins observed in susceptible individuals. Thus the factors involved in the switch from tolerance to disease need to be investigated. Initially, we performed an in-depth sequence and structural analysis of foreign proteins sharing high sequence similarity regions with the 33-mer and p31-43 sequences. The top-five BLASTp hits using the 33-mer and the p31-43 sequence as queries are presented in Table 1. Overall the identified similarity regions reached up to 68% sequence identity in the 33-mer and up to 85% for the p31-43 similarity regions. Importantly, 5 out of 10 identified proteins belonging to the host organisms known to be pathogenic for humans to different degrees. The complete information obtained in this work is summarized in Table 1 and discussed in the following sections.

2.2. The Structural Evaluation of Gliadin Peptides Harboring Pathogen-Related Proteins Similarity Regions and Their Function

2.2.1. Spatial Localization and Structural Information of the Relevant Gliadin-Derived Peptides in the α-2-Gliadin 3D Model

Wheat gluten proteins can be classified into soluble gliadins and insoluble low and high-molecular-weight glutenins depending on their solubility in aqueous/alcohol mixtures [37]. Moreover, gliadins can be classified into α/β-, γ- and ω-gliadins according to their primary sequence and electrophoretic mobility [37]. Gliadins are named prolamins because of their high content of proline (Pro) and glutamine (Gln) amino acids, having self-assembly capabilities [20,38]. More than 50 immunogenic epitopes are present in different classes of wheat gluten proteins [13]. Considering the absence of structural information about gliadins because of their intrinsic disorder (ID) (Figure 2), we previously built a tridimensional model of α-2-gliadin [19]. Here, we present an improved model (Figure 2B) that incorporates the disulfide bridges between Cys144-Cys177, Cys175-Cys269, and Cys187-Cys277 [39]. Interestingly, the presence of the disulfide bonds leads to an increase in conformational disorder (Figure 2B). However, despite the notorious topological change, the structure-based aggregation profile remains nearly unchanged (Figure 2D,E). Interestingly, both proteolytic-resistant peptides are predicted to be exposed to the solvent and are located in disordered regions that are spatially closed in both tridimensional models (Figure 2B,C).

2.2.2. The Homology Models of Pathogen-Related Proteins Containing Sequences Similar to 33-mer Gliadin Peptide and Their Function

The PQQ-repeat protein of S. viridochromogenes harbors one transmembrane domain of pyrroloquinoline quinone (PQQ) repeat sequence (Table 1 (A): Hit#1). This suggests that it belongs to the quinoprotein alcohol dehydrogenase-like protein superfamily, which has a β-propeller-like fold (Figure 3A). This architecture is composed of four-stranded antiparallel and twisted β-sheets, which are radially distributed, forming a central tunnel similar to a TIM-barrel fold (Figure 3A). The first ~150 amino acids could not be partially modeled due to the high sequence disorder (Figure 3A). In general, the PQQ serves as a redox cofactor for several enzymes, e.g., bacterial dehydrogenases [40]. A study showed that dietary PQQ exposure resulted in apparent antioxidant potential changes and significant decreases in the levels of plasma C-reactive protein, IL-6, urinary methylated amines such as trimethylamine N-oxide, and changes in urinary metabolites consistent with enhanced mitochondrial-related functions [41].

The aspartate kinase from Fischerella (Table 1 (A): Hit#2; Figure 3B) contains a putative nucleotide and Mg²⁺ ion-binding site, as well as two putative allosteric regulatory sites. Aspartate kinases catalyze the ATP-dependent reaction of L-aspartate to 4-phospho-L-aspartate and ADP and are involved in amino acid metabolism [32].

The 2036 residues-long hypothetical protein of unknown function from Granulicatella sp. contains a F/YSIRK-type signal peptide (Table 1 (A): Hit#3) with the sequence FSIRKxxxGxxS [42], which suggests a transmembrane destination in line with our results presented below. Due to the lack of structural data of related proteins, the intrinsic disorder profile (Figure S1), and the large size of the protein, we could not model this protein to obtain 3D structural information. However, the C-terminal LPxTG motif strongly suggests that this protein can be processed by sortases [43]. Sortases are cysteine transpeptidases, which decorate the protein surfaces and are extensively found in Gram-positive bacteria [43]. Proteins modified by sortases are often involved in infection, virulence, and colonization processes [44,45].

Choline-binding proteins (cbps) such as cbp A from S. pneumoniae (Table 1 (A): Hit#4, Figure 4B) are non-covalently bound cell-surface proteins involved in virulence [46,47]. Cbp A increases the expression of intercellular adhesion molecule 1 (ICAM-1, also known as CD54), an early inflammatory marker [47]. Therefore, cbp A induces the transcription and release of proinflammatory molecules by human alveolar epithelial cells [47]. ICAM-1 is necessary for the initial contact of circulating leukocytes with activated endothelium and is up-regulated or constitutively expressed in response to proinflammatory cytokines on endothelial cells, activated lymphocytes, monocytes, and epithelial cells [48]. Further, the histological evaluation revealed increased expression of ICAM-1 in most cells of the lamina propria in jejunal biopsies from CeD patients.

Additionally, it was observed that ingestion of gluten induced a rapid increase in ICAM-1 expression in CeD patients compared to healthy control and patients with non-celiac gluten sensitivity [49]. Importantly, a specific variant of ICAM-1 with an arginine at position 241 is known to be a predisposing factor for the development of CeD in adulthood [50]. Cbp A has also been shown to bind the secretory component of immunoglobulin A, the complement protein C3 [51], and recently facilitated invasion into and through nasopharyngeal epithelial cells by interacting with the human polymeric immunoglobulin receptor [52]. This receptor is expressed by mucosal epithelial cells binding polymeric immunoglobulins and releasing them as secretory antibodies in the mucosa [53]. In this direction, it was reported that intestinal transport of intact 33-mer and p31-49 was blocked by polymeric and secretory IgA (SIgA) and by soluble CD71 receptors, pointing to a role of SIgA-gliadin complexes in this abnormal intestinal transport [54].

The Hit#5 from Nostoc sp. PCC 7524 is predicted to be a cation/multidrug RND (Resistance-Nodulation-Division) efflux pump with transporter activity and belongs to the AcrB/D/F protein family (Table 1 (A): Hit#5, Figure 4A). AcrB/D/F proteins are integral membrane proteins [55]. Some are involved in multidrug resistance [55]. For example, AcrB cooperates with the membrane fusion protein AcrA and an outer membrane channel TolC and forms homotrimers [55].

2.2.3. The Homology Models of Pathogen-Related Proteins Containing the p31-43 Similar Sequence and Their Function

Three MinD/ParA family proteins from Streptomyces sp. were identified (Table 1 (B): Hits#1–3). With an overall similar ID profile for all three proteins and high content of sequence disorder, their tridimensional structure could not be modeled (Figure 5A–C). The identified MinD/ParA family proteins contain an FlhG domain characterized through a conserved nucleotide phosphate-binding motif and involved in diverse cellular functions such as chromosome partitioning or flagellar assembly [56]. ParA and MinD are members of a larger family of ATPases, defined by the specific variant of their Walker A sequences (KGGXXK[ST]) [57]. In general, Par is a transportation system for DNA [58]. Indeed, the partition ATPase (ParA) provides energy for DNA transportation and is related to other ATPases that localize components within a bacterial cell [58].

Regarding its function as a DNA-transport system, the protein possesses non-specific DNA binding activity. It binds DNA with a patch of basic amino acids located at the C-terminus [59,60,61]. However, the membrane-associated ATPase MinD is required for localizing the Escherichia coli cell division machinery at mid-cell by acting as cell division inhibitor and activating MinC [62]. Interaction of MinD with MinC was predicted to be located at the residues around the nucleotide-binding site in MinD [63]. The positioning of MinD/ParA proteins is either due to self-organization on a surface or reliance on a landmark protein that functions as a molecular beacon [64].

Finally, the hypothetical calcium-binding protein from L. edodes (Table 1 (B): Hit#4) shows a low ID score except for a large disordered C-terminal region (Figure 5D). Thus, the hypothetical protein was modeled with high accuracy for more than 95% of the amino acids (Figure 5D).

2.3. Primary Structure Analysis and Potential Functions

2.3.1. CeD-T-Cell Epitopes

The molecular mimicry hypothesis considers that pathogens and/or pathogen-derived proteins/peptides sharing molecular and structural similarities with gluten fragments trigger a strong humoral immunological response. After the primary disease is treated, the host immune system recognizes gluten pathogenic fragments as if they are signatures of the bacterial pathogens [65,66].

Molecular mimicry is nowadays recognized as a pathogenic mechanism by which infectious or chemical agents may induce autoimmunity and occurs when foreign and self-peptides share similarities at the sequence or structural level, favoring the activation of autoreactive T or B cells in predisposed individuals [67]. There is extensive documentation of autoimmune diseases that have been associated by molecular mimicry with foreign pathogens such as acute gastroenteritis by rotavirus [68], autoimmune thyroid diseases by Helicobacter pylori and Hepatitis C virus [69], systemic lupus erythematosus by Leishmania sp. [70], acute rheumatic fever by group A streptococci [71,72] and recently the SARS-CoV-2 coronavirus, the cause of the worldwide COVID-19 pandemic disease, with the Guillain-Barré syndrome [73].

In this context, the 33-mer sequence is responsible for the adaptive immune response in CeD because it contains six partially overlapping copies of canonical T-cell epitopes [14]: three copies of the DQ2.5-glia-α1- (PF/YPQPQLPY) and the DQ2.5-glia-α2 (PQPQLPYPQ) epitope [74]. Thus, one of the most relevant findings is the presence of two partially overlapping copies of the PYGQPQAPY motif in the PQQ-repeat protein of S. viridochromogenes (Table 1 (A): Hit#1), which is similar to the DQ2.5-glia-α1 epitope (Figure 3A). The two mismatches are P to G at position 3 and L to A at position 7 (Figure 3A). Given more than 70 variants found in the canonical sequence of Triticum species [74] and considering the conservative physicochemical properties and size of the replacements, the identified sequence could be a potential immunodominant T-cell DQ2 epitope. Interestingly, the RND efflux pump from Nostoc sp. contains three partially overlapping copies of the PNPQSPXP sequence where X is I or V (Figure 5). Furthermore, there are two or three Q to E variations in two pathogens-related proteins, the Efflux RND transporter permease and F/YSIRK-type protein (Table 1 (A): Hits#5 and #3; Figure 4). The introduction of a negative charge through deamidation leads to a higher binding affinity of the 33-mer gliadin sequence to DQ2/8 MHC class-II molecules on the antigen-presenting cell membrane [75], increasing the immunogenicity of this sequence. Even more, the deamidated variant of the 33-mer peptide is known to be an extremely potent T-cell stimulator in comparison with other gluten-derived peptides [76,77]. The heterogeneity of gluten protein sequences makes it challenging to elucidate the entire repertoire of potential peptides involved in the pathogenesis of CeD [78], and these sequences need to be experimentally investigated.

2.3.2. SH3/WW Domains Binders

Although the functions of the similarity regions of the pathogen proteins are unknown, their localization and polyproline II (PPII) propensity (due to the high content of proline and glutamine amino acids) suggest that they could be potential targets in protein-protein interaction with SH3 domains. Many proteins of the Src kinases family carry small modules named Src Homology 3 (SH3) domains with a characteristic β-barrel fold, which usually enables the binding to proline-rich sequences with a PPII conformation [79]. The SH3 domains are found ubiquitously in all eukaryotes, some prokaryotes, and even viruses [79]. With more than 300 SH3 domains encoded in the human genome, these are crucial elements in protein-protein interactions in several signal transduction pathways [80].

In addition, the WW domain mediates specific protein-protein interactions with short proline-rich or proline-containing motifs [81]. The SH3- and WW domains usually bind to PxxP and xPPx motifs, respectively. In this regard, both the 33-mer and p31-43 peptides could be SH3-binding partners due to their PQLP and PQQP sequences, respectively. The 33-mer similarity sequences for the PxxP SH3-binding motif are found in the form of PEKP (Granulicatella sp.); PEKP, PAQP, and PSTP (S. pneumoniae); and PSSP, PINP, PFIP, PKIP, PQSP, and PKLP (Nostoc sp.) who are predicted by Protter web server [82] to be extracellular (Figure 5). Meanwhile, PQSP and PRSP (Fischerella sp.) and PQAP (S. viridochromogenes) would be located intracellular (Figure 6). On the other hand, only the p31-43 and its similar pathogen sequences have the Y/FPPQ sequence predicted to be in the cytoplasm and with the potential capability to bind the WW domain (Figure 7) [83,84]. Thus, both sequences found in gliadin and the pathogen-related proteins may potentially bind to a plethora of proteins in vivo, which could lead to several unknown metabolic (dis-)functions.

2.4. Gliadin Peptide Superstructures, Pathogen Morphology, and Dysbiosis as Triggers of the Innate Immune Response: The Hypothesis

The significance of the term pathogen as “things” capable of causing human diseases was historically associated with microorganisms. Nevertheless, Griffith’s proposed the pathogenic role of Prions proteins in scrapie in 1967, and the seminal work of Prusiner and coworkers in 1982 laid the groundwork to reconsidering the meaning of the term. Nowadays, the term pathogen was replaced by the widely accepted infectious agent, which includes biomolecules such as proteins. However, as suggested by Methot and Alizon, a more complex scenario needs to be considered as an ecological, evolutionary, and immunological context takes a prominent role in the host-pathogen interaction [85].

In this broad scenario, it could also happen that gluten peptides that share structural/morphological similarities with pathogens have latent pathogenicity and, although initially innocuous to the host, after their accumulation and oligomerization with the conformational transition toward amyloid structures, start to be recognized by the host innate immune as non-replicating pathogens. Interestingly, all the identified host pathogens in the BLASTp search share a rod-like morphology that is linearly organized. It is very similar to the morphology of the 33-mer superstructures and, to some extent, to p31-43 oligomers (Figure 8).

Several studies in vitro and in vivo agreed that particles in the size range of 40 nm to 10 μm are the most immunologically active [86,87,88], which fits well with the range of the 33-mer oligomers (10 nm to more than 1 µm). Paul et al. demonstrated that target shape plays a more prominent role than size in the phagocytosis process [89]. The rod-like morphology of gliadin peptides and their capability to form larger superstructures may be sufficient to generate an early immune response and might serve as general disease-causing signals.

As shown previously in macrophages, only the presence of the larger 33-mer structures activates TLR4 and TLR2 [21]. Additionally, the p31-43 oligomers are involved in the inflammasome activation in murine models [26]. Under cumulative conditions due to their inadequate proteolysis, the morphology of the peptide oligomers and particularly the formation of 2D and 3D nano- and microstructures could trigger an early innate response. Unluckily, the presence of canonical T-cell epitopes in the 33-mer sequence triggers the adaptive immune response in CeD patients.

Additionally, it is reported that gliadin acts as a modulator of human microbiota [27], and it can not be discarded that changes in the microbiota due to the presence of gluten participate in GRDs [28,29]. The gliadin superstructures’ presence could also interfere negatively in the initial attachment and subsequent colonization of beneficial bacteria. Another possibility could be that the morphological similarities with pathogenic bacteria favor their attachment and colonization in the mucosa. Notably, both scenarios would lead to dysbiosis, an imbalance of the normal microbiota of the gut [94]. Forsberg et al. showed the first demonstration of morphologically uniform bacteria associated with the mucosa of CeD patients [95], although the presence of bacteria has been implicated in CeD earlier [96,97]. The authors classified bacteria as rod-shaped if the length/diameter ratio was >1.5 and as cocci form if the ratio was ≤1.5 [95]. Bacteria are commonly seen in the intestinal mucosa of patients with active CeD [98]. They are mainly rod-shaped and seemed to adhere to one end between or on the microvilli of the epithelial cells and often appeared in bouquet-like groups [98]. Furthermore, coccoid bacteria were also detected. Moreover, the bacteria either covered the entire surface or were present in large patches covering most of the surface and the lesions observed in CeD patients. Visually, each sample contained bacteria of only one phenotype [95]. Further, in patients with inflammatory bowel disease, a disease sharing several parallels with CeD, attachment of bacteria to the intestinal epithelium was demonstrated [99]. In this regard, dysbiosis is observed in young CeD patients [100]. A recent study has shed light on the interaction between host genetics and dysbiosis with CeD development [101]. Even more, an increase in rod-shaped bacteria in the proximal small intestine microbiota, dominated by Streptococcus, Staphylococcus, and Actinobacteria, was found to contribute to the development of CeD in young patients [102]. Then, is CeD a risk factor for dysbiosis, or is dysbiosis a risk factor for CeD?

Finally, young CeD patients have unique carbohydrate structures of the glycocalyx/mucous layer, facilitating bacterial adhesion in the proximal small intestine [95]. Although, it was not possible to distinguish between the possibilities that altered glycosylation is an inherited property of individuals who contract CeD or that bacteria influence glycosylation. In this regard, all target proteins show several N-glycosylation motifs (Figure 6 and Figure 7), which reinforce the idea of a potential role as a foreign trigger of CeD. This observation is another example of a relationship between specific pathogens and CeD pathogenesis because pathogen presence could indicate aberrant innate immunity. In these situations, prolamin may irritate the epithelial surface and even be mistaken for a pathogen [95].

Besides the high sequence, structural and morphological similarities reported here, the overall disease course caused by gluten-derived peptides, infection agents, or pathogens share remarkably similar characteristics [6]. First, gluten peptides and pathogens can both enter the intestinal lumen through oral intake. Exposure to most pathogens and gluten is necessary but not sufficient to cause disease because the genetic background and susceptibility, and dose effects, are additional determinants for disease progression [103]. For example, gluten peptides and Granulicatella sp. are ubiquitous but only cause disease in susceptible individuals. In this point of view, the flora components alter the homeostasis of the immune system, reshaping the immune environment and promoting the development of specific diseases such as CeD. Therefore, the default setup of intestinal immunity is the generation of tolerance unless specific signals evoke inflammatory reactions. The antigens present in the gut lumen are constantly sampled by intestinal dendritic cells and presented to the T cells in either Peyer’s patches or mesenteric lymph nodes, which results in the generation of regulatory CD4⁺ T cells [104]. Moreover, high concentrations of anti-inflammatory cytokines such as IL-10 and TGF-β are usually found in the intestine of celiac patients [105]. Whereas, speaking about GRDs or bacterial infections, only after their complete elimination from the host body after infection results in remission, and reintroduction causes recurrence of symptoms and/or disease [106].

3. Materials and Methods

The α-2-gliadin sequence from Triticum aestivum was retrieved from the UniProt database under the accession number Q9M4L6 to obtain the sequence of the 33-mer (57–89 region) and p31-43 (31–43 region) peptides. To identify proteins that share high sequence similarity with the two mentioned relevant celiac peptides, a protein-protein BLAST [30] and an optional SmartBlast search were performed using a BLOSUM62 matrix with gap cost (existence 11 and extension 1). This search was performed using non-redundant protein sequences with a word size of 6 amino acids. Only top-scored hits outside of the Gramineae family (taxonomy ID 4479) were used for further analysis.

Protein sequences obtained by BLASTp search, designated as target proteins, were subjected to predicting the natively disordered regions using PrDOS server version 2.0 [107], setting a false positive rate of 5%. After careful manual inspection of those target proteins or related proteins with an experimentally solved structure, respective amino acid sequences were subjected to structural modeling using the protein homology/analogy recognition engine (PHYRE) server version 2.0 [108] in the intensive mode. Only those structure models for which more than 80% of the sequence could be modeled and yielded more than 90% of confidence were selected for further analysis. The models were then subjected to an energy minimization to remove any clashes using a standard protocol in the Amber14 [109] suite as we previously described [19]. The percentage of solvent-accessible surface area (SASA) was calculated using the vmdICE [110] plug-in for VMD 1.9.3 [111] using a water probe radii of 1.4 Å and normalized by the value of the corresponding free amino acid in a GXG peptide context. The structure-based prediction of aggregation hot-spots was performed using AGGRESCAN 3D [112] in the static mode setting a distance cutoff of 10 Å. All protein representations were prepared using VMD 1.9.3 [111]. Subcellular localization prediction was performed with Protter version 1.0 [82].

4. Conclusions

Comparing both sets of pathogen-derived proteins presented in Table 1, the p31-43 similarity regions are located mainly in the C-terminal part of the target proteins; meanwhile, the respective 33-mer similarity regions are randomly distributed across the target proteins (Figure 3, Figure 4 and Figure 5). The majority of the similarity regions were identified to be located in disordered or partially disordered regions and solvent-exposed; however, they are located in areas not prone to aggregation in the bacterial proteins (Figure 3, Figure 4 and Figure 5). Interestingly, three out of five of the 33-mer similarity regions are predicted to be located in the extracellular space (Table 1 (A): Hits#3–5, Figure 6). In contrast, the p31-43 similarity regions are predicted to be located intracellular (Figure 7). From the primary structure, gluten peptides and their respective pathogens-related proteins possess PxxP sequences capable of binding predominantly to the SH3 domain and the xPPx (p31-43) motif that binds to WW domains. Therefore, these sequences may potentially bind to a plethora of proteins in vivo, leading to novel metabolic (dis-)functions. Pathological overlaps between the protein and 33-mer peptide, e.g., induction of an immune response, were found for the non-covalent extracellular cbpA from S. pneumoniae, which function is to increase the expression of ICAM-1, an early inflammatory maker. In this regard, a specific variant of ICAM-1 with an arginine at position 241 is a predisposing factor for the development of CeD in adulthood. Another significant finding is Granulicatella sp., which is found in the gut and reported in the case of CeD, and the corresponding pathogen-related protein has potential celiac disease T-cell recognition motifs. The molecular and structural similarities with Granulicatella sp. points out the necessity to investigate the role of these pathogens in the development of CeD by molecular mimicry mechanisms.

Finally, due to similarities between the size and shape of the pathogens to the 33-mer supramolecular structure, we hypothesize that the gliadin peptide capability of forming rod-like and biofilm-similar structures may be connected to the dysbiosis observed in active CeD and the other less understood GRDs. Therefore, it is hypothesized that the 3D patterns formed by the gliadin peptides could play a role in selective bacterial attachment and colonization that can shift the gut homeostasis toward dysbiosis. In addition, the formation of gliadin superstructures could be a pathological trigger that activates the innate immune system, which depending on individual susceptibility, would lead to CeD or the different GRDs. In summary, our findings stress the importance of further experimental research in the field of gut microbiota, particularly with their connection to CeD and other GRDs, with the final aim to improve the health and life quality of gluten-susceptible people.

Supplementary Materials

The following are available online at https://www.mdpi.com/article/10.3390/ijms22179278/s1.

Author Contributions

Conceptualization, Methodology, Formal Analysis, Investigation, Writing—Review and Editing, D.S.V. and H.M.S.; Conceptualization, Funding, Supervision, Investigation, Writing—Review and Editing, V.I.D. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Deutsche Forschungsgemeinschaft Grant (DFG, Grant number 430578458) and the BMBF-funded de.NBI Cloud within the German Network for Bioinformatics Infrastructure (de.NBI) (Grant number 031A532B, 031A533A, 031A533B, 031A534A, 031A535A, 031A537A, 031A537B, 031A537C, 031A537D, 031A538A). We acknowledge support for the publication costs by the Open Access Publication Fund of Bielefeld University.

Data Availability Statement

The data that support the findings of this study are available from the corresponding author upon reasonable request.

Acknowledgments

V.I.D. and H.M.S. thank Karsten Niehaus for his support to start this work. D.S.V. is a member of the Scientific and Technological Researcher Career of CONICET. All authors thank Fernando Chirdo for critically reviewing the first draft of this manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

CeD	celiac disease
ID	intrinsic disorder
MHC	major histocompatibility complex
PPII	type-II polyproline
SASA	solvent-accessible surface area

References

Cabanillas, B. Gluten-related disorders: Celiac disease, wheat allergy, and nonceliac gluten sensitivity. Crit. Rev. Food Sci. Nutr. 2020, 60, 2606–2621. [Google Scholar] [CrossRef]
Taraghikhah, N.; Ashtari, S.; Asri, N.; Shahbazkhani, B.; Al-Dulaimi, D.; Rostami-Nejad, M.; Rezaei-Tavirani, M.; Razzaghi, M.R.; Zali, M.R. An updated overview of spectrum of gluten-related disorders: Clinical and diagnostic aspects. BMC Gastroenterol. 2020, 20, 258. [Google Scholar] [CrossRef]
Di Sabatino, A.; Corazza, G.R. Coeliac disease. Lancet 2009, 373, 1480–1493. [Google Scholar] [CrossRef]
Shan, L.; Molberg, Ø.; Parrot, I.; Hausch, F.; Filiz, F.; Gray, G.M.; Sollid, L.M.; Khosla, C. Structural Basis for Gluten Intolerance in Celiac Sprue. Science 2002, 297, 2275–2279. [Google Scholar] [CrossRef]
Petersen, J.; Ciacchi, L.; Tran, M.T.; Loh, K.L.; Kooy-Winkelaar, Y.; Croft, N.P.; Hardy, M.I.; Chen, Z.; McCluskey, J.; Anderson, R.P.; et al. T cell receptor cross-reactivity between gliadin and bacterial peptides in celiac disease. Nat. Struct. Mol. Biol. 2020, 27, 49–61. [Google Scholar] [CrossRef] [PubMed]
Bethune, M.T.; Khosla, C. Parallels between Pathogens and Gluten Peptides in Celiac Sprue. PLoS Pathog. 2008, 4, e34. [Google Scholar] [CrossRef]
Verdu, E.F.; Schuppan, D. The enemy within the gut: Bacterial pathogens in celiac autoimmunity. Nat. Struct. Mol. Biol. 2019, 27, 5–7. [Google Scholar] [CrossRef] [PubMed]
Lammers, K.M.; Herrera, M.G.; Dodero, V.I. Translational Chemistry Meets Gluten-Related Disorders. ChemistryOpen 2018, 7, 217–232. [Google Scholar] [CrossRef] [PubMed]
Doshi, N.; Mitragotri, S. Macrophages Recognize Size and Shape of Their Targets. PLoS ONE 2010, 5, e10051. [Google Scholar] [CrossRef]
Swartzwelter, B.; Fux, A.; Johnson, L.; Swart, E.; Hofer, S.; Hofstätter, N.; Geppert, M.; Italiani, P.; Boraschi, D.; Duschl, A.; et al. The Impact of Nanoparticles on Innate Immune Activation by Live Bacteria. Int. J. Mol. Sci. 2020, 21, 9695. [Google Scholar] [CrossRef]
Herrera, M.G.; Nicoletti, F.; Gras, M.; Dörfler, P.W.; Tonali, N.; Hannappel, Y.; Ennen, I.; Hütten, A.; Hellweg, T.; Lammers, K.M.; et al. Pepsin Digest of Gliadin Forms Spontaneously Amyloid-Like Nanostructures Influencing the Expression of Selected Pro-Inflammatory, Chemoattractant, and Apoptotic Genes in Caco-2 Cells: Implications for Gluten-Related Disorders. Mol. Nutr. Food Res. 2021, 65, 2100200. [Google Scholar] [CrossRef] [PubMed]
Maiuri, L.; Ciacci, C.; Ricciardelli, I.; Vacca, L.; Raia, V.; Auricchio, S.; Picard, J.; Osman, M.; Quaratino, S.; Londei, M. Association between innate response to gliadin and activation of pathogenic T cells in coeliac disease. Lancet 2003, 362, 30–37. [Google Scholar] [CrossRef]
Londei, M.; Ciacci, C.; Ricciardelli, I.; Vacca, L.; Quaratino, S.; Maiuri, L. Gliadin as a stimulator of innate responses in celiac disease. Mol. Immunol. 2005, 42, 913–918. [Google Scholar] [CrossRef] [PubMed]
Qiao, S.-W.; Bergseng, E.; Molberg, Ø.; Xia, J.; Fleckenstein, B.; Khosla, C.; Sollid, L.M. Antigen Presentation to Celiac Lesion-Derived T Cells of a 33-Mer Gliadin Peptide Naturally Formed by Gastrointestinal Digestion. J. Immunol. 2004, 173, 1757–1762. [Google Scholar] [CrossRef]
Fraser, J.S.; Engel, W.; Ellis, H.J.; Moodie, S.J.; Pollock, E.L.; Wieser, H.; Ciclitira, P.J. Coeliac disease: In vivo toxicity of the putative immunodominant epitope. Gut 2003, 52, 1698–1702. [Google Scholar] [CrossRef]
Sollid, L.M. Intraepithelial Lymphocytes in Celiac Disease: License to Kill Revealed. Immunity 2004, 21, 303–304. [Google Scholar] [CrossRef]
Herrera, M.G.; Zamarreño, F.; Costabel, M.; Ritacco, H.; Hütten, A.; Sewald, N.; Dodero, V.I. Circular dichroism and electron microscopy studies in vitro of 33-mer gliadin peptide revealed secondary structure transition and supramolecular organization. Biopolymers 2014, 101, 96–106. [Google Scholar] [CrossRef]
Herrera, M.G.; Benedini, L.; Lonez, C.; Schilardi, P.L.; Hellweg, T.; Ruysschaert, J.-M.; Dodero, V.I. Self-assembly of 33-mer gliadin peptide oligomers. Soft Matter 2015, 11, 8648–8660. [Google Scholar] [CrossRef]
Herrera, M.; Vazquez, D.; Sreij, R.; Drechsler, M.; Hertle, Y.; Hellweg, T.; Dodero, V. Insights into gliadin supramolecular organization at digestive pH 3.0. Colloids Surf. B Biointerfaces 2018, 165, 363–370. [Google Scholar] [CrossRef]
Herrera, M.G.; Veuthey, T.V.; Dodero, V.I. Self-organization of gliadin in aqueous media under physiological digestive pHs. Colloids Surf. B Biointerfaces 2016, 141, 565–575. [Google Scholar] [CrossRef] [PubMed]
Herrera, M.G.; Pizzuto, M.; Lonez, C.; Rott, K.; Hütten, A.; Sewald, N.; Ruysschaert, J.-M.; Dodero, V.I. Large supramolecular structures of 33-mer gliadin peptide activate toll-like receptors in macrophages. Nanomed. Nanotechnol. Biol. Med. 2018, 14, 1417–1427. [Google Scholar] [CrossRef]
Amundarain, M.J.; Herrera, M.G.; Zamarreño, F.; Viso, J.F.; Costabel, M.D.; Dodero, V.I. Molecular mechanisms of 33-mer gliadin peptide oligomerisation. Phys. Chem. Chem. Phys. 2019, 21, 22539–22552. [Google Scholar] [CrossRef] [PubMed]
Falcigno, L.; Calvanese, L.; Conte, M.; Nanayakkara, M.; Barone, M.V.; D’Auria, G. Structural Perspective of Gliadin Peptides Active in Celiac Disease. Int. J. Mol. Sci. 2020, 21, 9301. [Google Scholar] [CrossRef]
Nanayakkara, M.; Lania, G.; Maglio, M.; Auricchio, R.; De Musis, C.; Discepolo, V.; Miele, E.; Jabri, B.; Troncone, R.; Auricchio, S.; et al. P31–43, an undigested gliadin peptide, mimics and enhances the innate immune response to viruses and interferes with endocytic trafficking: A role in celiac disease. Sci. Rep. 2018, 8, 1–12. [Google Scholar] [CrossRef]
Herrera, M.G.; Castro, M.F.G.; Prieto, E.; Barrera, E.; Dodero, V.I.; Pantano, S.; Chirdo, F. Structural conformation and self-assembly process of p31-43 gliadin peptide in aqueous solution. Implications for celiac disease. FEBS J. 2019, 287, 2134–2149. [Google Scholar] [CrossRef]
Castro, M.F.G.; Miculán, E.; Herrera, M.G.; Ruera, C.; Perez, F.; Prieto, E.D.; Barrera, E.; Pantano, S.; Carasi, P.; Chirdo, F.G. p31-43 Gliadin Peptide Forms Oligomers and Induces NLRP3 Inflammasome/Caspase 1- Dependent Mucosal Damage in Small Intestine. Front. Immunol. 2019, 10, 31. [Google Scholar] [CrossRef]
Bascuñán, K.A.; Araya, M.; Roncoroni, L.; Doneda, L.; Elli, L. Dietary Gluten as a Conditioning Factor of the Gut Microbiota in Celiac Disease. Adv. Nutr. 2019, 11, 160–174. [Google Scholar] [CrossRef] [PubMed]
Akobeng, A.K.; Singh, P.; Kumar, M.; Al Khodor, S. Role of the gut microbiota in the pathogenesis of coeliac disease and potential therapeutic implications. Eur. J. Nutr. 2020, 59, 3369–3390. [Google Scholar] [CrossRef]
Polo, A.; Arora, K.; Ameur, H.; Di Cagno, R.; De Angelis, M.; Gobbetti, M. Gluten-free diet and gut microbiome. J. Cereal Sci. 2020, 95, 103058. [Google Scholar] [CrossRef]
Altschul, S.F.; Gish, W.; Miller, W.; Myers, E.W.; Lipman, D.J. Basic local alignment search tool. J. Mol. Biol. 1990, 215, 403–410. [Google Scholar] [CrossRef]
Weitnauer, G.; Mühlenweg, A.; Trefzer, A.; Hoffmeister, D.; Süßmuth, R.; Jung, G.; Welzel, K.; Vente, A.; Girreser, U.; Bechthold, A. Biosynthesis of the orthosomycin antibiotic avilamycin A: Deductions from the molecular analysis of the avi biosynthetic gene cluster of Streptomyces viridochromogenes Tü57 and production of new antibiotics. Chem. Biol. 2001, 8, 569–581. [Google Scholar] [CrossRef]
Viola, R.E. The Central Enzymes of the Aspartate Family of Amino Acid Biosynthesis. Accounts Chem. Res. 2001, 34, 339–349. [Google Scholar] [CrossRef] [PubMed]
Cargill, J.S.; Scott, K.S.; Gascoyne-Binzi, D.; Sandoe, J.A.T. Granulicatella infection: Diagnosis and management. J. Med. Microbiol. 2012, 61, 755–761. [Google Scholar] [CrossRef]
Brooks, L.R.K.; Mias, G.I. Streptococcus pneumoniae’s Virulence and Host Immunity: Aging, Diagnostics, and Prevention. Front. Immunol. 2018, 9, 1366. [Google Scholar] [CrossRef]
Nowruzi, B.; Khavari-Nejad, R.-A.; Sivonen, K.; Kazemi, B.; Najafi, F.; Nejadsattari, T. Identification and toxigenic potential of a Nostoc sp. ALGAE 2012, 27, 303–313. [Google Scholar] [CrossRef]
McNally, A.; Ross, C.; Wayte, J. Shiitake dermatitis: The tale of an under-recognised, undercooked fungus. Med. J. Aust. 2016, 204, 124–126. [Google Scholar] [CrossRef] [PubMed]
Urade, R.; Sato, N.; Sugiyama, M. Gliadins from wheat grain: An overview, from primary structure to nanostructures of aggregates. Biophys. Rev. 2018, 10, 435–443. [Google Scholar] [CrossRef]
Hausch, F.; Shan, L.; Santiago, N.A.; Gray, G.M.; Khosla, C. Intestinal digestive resistance of immunodominant gliadin peptides. Am. J. Physiol. Liver Physiol. 2002, 283, G996–G1003. [Google Scholar] [CrossRef]
Anderson, O.D.; Dong, L.; Huo, N.; Gu, Y.Q. A New Class of Wheat Gliadin Genes and Proteins. PLoS ONE 2012, 7, e52139. [Google Scholar] [CrossRef]
Misra, H.S.; Rajpurohit, Y.S.; Khairnar, N.P. Pyrroloquinoline-quinone and its versatile roles in biological processes. J. Biosci. 2012, 37, 313–325. [Google Scholar] [CrossRef] [PubMed]
Harris, C.B.; Chowanadisai, W.; Mishchuk, D.O.; Satre, M.A.; Slupsky, C.M.; Rucker, R.B. Dietary pyrroloquinoline quinone (PQQ) alters indicators of inflammation and mitochondrial-related metabolism in human subjects. J. Nutr. Biochem. 2013, 24, 2076–2084. [Google Scholar] [CrossRef] [PubMed]
Bae, T.; Schneewind, O. The YSIRK-G/S Motif of Staphylococcal Protein A and Its Role in Efficiency of Signal Peptide Processing. J. Bacteriol. 2003, 185, 2910–2919. [Google Scholar] [CrossRef]
Spirig, T.; Weiner, E.M.; Clubb, R.T. Sortase enzymes in Gram-positive bacteria. Mol. Microbiol. 2011, 82, 1044–1059. [Google Scholar] [CrossRef]
Schneewind, O.; Missiakas, D. Sortases, Surface Proteins, and Their Roles in Staphylococcus aureus Disease and Vaccine Development. Microbiol. Spectr. 2019, 7. [Google Scholar] [CrossRef]
Cossart, P.; Jonquières, R. Sortase, a universal target for therapeutic agents against Gram-positive bacteria? Proc. Natl. Acad. Sci. USA 2000, 97, 5013–5015. [Google Scholar] [CrossRef]
Gosink, K.K.; Mann, E.R.; Guglielmo, C.; Tuomanen, E.I.; Masure, H.R. Role of Novel Choline Binding Proteins in Virulence of Streptococcus pneumoniae. Infect. Immun. 2000, 68, 5690–5695. [Google Scholar] [CrossRef] [PubMed]
Murdoch, C.; Read, R.; Zhang, Q.; Finn, A. Choline-Binding Protein A ofStreptococcus pneumoniaeElicits Chemokine Production and Expression of Intercellular Adhesion Molecule 1 (CD54) by Human Alveolar Epithelial Cells. J. Infect. Dis. 2002, 186, 1253–1260. [Google Scholar] [CrossRef] [PubMed]
Jelínková, L.; Tučková, L.; Cinová, J.; Flegelová, Z.; Tlaskalová-Hogenová, H. Gliadin stimulates human monocytes to production of IL-8 and TNF-α through a mechanism involving NF-κB. FEBS Lett. 2004, 571, 81–85. [Google Scholar] [CrossRef]
Jelínková, L.; Tučková, L.; Sánchez, D.; Krupičková, S.; Pozler, O.; Nevoral, J.; Kotalová, R.; Tlaskalová-Hogenová, H. Increased levels of circulating ICAM-1, E-selectin, and IL-2 receptors in celiac disease. Dig. Dis. Sci. 2000, 45, 398–402. [Google Scholar] [CrossRef]
Abel, M.; Cellier, C.; Kumar, N.; Cerf-Bensussan, N.; Schmitz, J.; Caillat-Zucman, S. Adulthood-Onset Celiac Disease Is Associated with Intercellular Adhesion Molecule-1 (ICAM-1) Gene Polymorphism. Hum. Immunol. 2006, 67, 612–617. [Google Scholar] [CrossRef]
Hammerschmidt, S.; Talay, S.R.; Brandtzaeg, P.; Chhatwal, G.S. SpsA, a novel pneumococcal surface protein with specific binding to secretory Immunoglobulin A and secretory component. Mol. Microbiol. 1997, 25, 1113–1124. [Google Scholar] [CrossRef]
Zhang, J.-R.; E Mostov, K.; E Lamm, M.; Nanno, M.; Shimida, S.-I.; Ohwaki, M.; Tuomanen, E. The Polymeric Immunoglobulin Receptor Translocates Pneumococci across Human Nasopharyngeal Epithelial Cells. Cell 2000, 102, 827–837. [Google Scholar] [CrossRef]
Maestro, B.; Sanz, J.M. Choline Binding Proteins from Streptococcus pneumoniae: A Dual Role as Enzybiotics and Targets for the Design of New Antimicrobials. Antibiotics 2016, 5, 21. [Google Scholar] [CrossRef] [PubMed]
Matysiak-Budnik, T.; Moura, I.C.; Arcos-Fajardo, M.; Lebreton, C.; Menard, S.; Candalh, C.; Ben-Khalifa, K.; Dugave, C.; Tamouza, H.; Van Niel, G.; et al. Secretory IgA mediates retrotranscytosis of intact gliadin peptides via the transferrin receptor in celiac disease. J. Exp. Med. 2007, 205, 143–154. [Google Scholar] [CrossRef]
Murakami, S. Multidrug efflux transporter, AcrB—the pumping mechanism. Curr. Opin. Struct. Biol. 2008, 18, 459–465. [Google Scholar] [CrossRef] [PubMed]
Ramm, B.; Heermann, T.; Schwille, P. The E. coli MinCDE system in the regulation of protein patterns and gradients. Cell. Mol. Life Sci. 2019, 76, 4245–4273. [Google Scholar] [CrossRef]
Koonin, E.V. A Superfamily of ATPases with Diverse Functions Containing Either Classical or Deviant ATP-binding Motif. J. Mol. Biol. 1993, 229, 1165–1174. [Google Scholar] [CrossRef]
Vecchiarelli, A.G.; Mizuuchi, K.; Funnell, B.E. Surfing biological surfaces: Exploiting the nucleoid for partition and transport in bacteria. Mol. Microbiol. 2012, 86, 513–523. [Google Scholar] [CrossRef]
Hester, C.M.; Lutkenhaus, J. Soj (ParA) DNA binding is mediated by conserved arginines and is essential for plasmid segregation. Proc. Natl. Acad. Sci. USA 2007, 104, 20326–20331. [Google Scholar] [CrossRef] [PubMed]
Castaing, J.-P.; Bouet, J.-Y.; Lane, D. F plasmid partition depends on interaction of SopA with non-specific DNA. Mol. Microbiol. 2008, 70, 1000–1011. [Google Scholar] [CrossRef] [PubMed]
Soberón, N.E.; Lioy, V.; Pratto, F.; Volante, A.; Alonso, J.C. Molecular anatomy of the Streptococcus pyogenes pSM19035 partition and segrosome complexes. Nucleic Acids Res. 2010, 39, 2624–2637. [Google Scholar] [CrossRef]
Hayashi, I.; Oyama, T.; Morikawa, K. Structural and functional studies of MinD ATPase: Implications for the molecular recognition of the bacterial cell division apparatus. EMBO J. 2001, 20, 1819–1828. [Google Scholar] [CrossRef]
Zhou, H.; Lutkenhaus, J. MinC Mutants Deficient in MinD- and DicB-Mediated Cell Division Inhibition Due to Loss of Interaction with MinD, DicB, or a Septal Component. J. Bacteriol. 2005, 187, 2846–2857. [Google Scholar] [CrossRef]
Lutkenhaus, J. The ParA/MinD family puts things in their place. Trends Microbiol. 2012, 20, 411–418. [Google Scholar] [CrossRef]
Oldstone, M.B.A. Molecular mimicry and immune-mediated diseases. FASEB J. 1998, 12, 1255–1265. [Google Scholar] [CrossRef]
Kohm, A.P.; Fuller, K.G.; Miller, S.D. Mimicking the way to autoimmunity: An evolving theory of sequence and structural homology. Trends Microbiol. 2003, 11, 101–105. [Google Scholar] [CrossRef]
Rojas, M.; Restrepo, P.; Monsalve, D.M.; Pacheco, Y.; Acosta-Ampudia, Y.; Ramírez-Santana, C.; Leung, P.S.; Ansari, A.A.; Gershwin, M.E.; Anaya, J.-M. Molecular mimicry and autoimmunity. J. Autoimmun. 2018, 95, 100–123. [Google Scholar] [CrossRef] [PubMed]
Gómez-Rial, J.; Calle, I.R.; Salas, A.; Martinón-Torres, F. Rotavirus and autoimmunity. J. Infect. 2020, 81, 183–189. [Google Scholar] [CrossRef] [PubMed]
Cuan-Baltazar, Y.; Soto-Vega, E. Microorganisms associated to thyroid autoimmunity. Autoimmun. Rev. 2020, 19, 102614. [Google Scholar] [CrossRef] [PubMed]
Múnera, M.; Farak, J.; Pérez, M.; Rojas, J.; Villero, J.; Sánchez, A.; Emiliani, Y. Prediction of molecular mimicry between antigens from Leishmania sp. and human: Implications for autoimmune response in systemic lupus erythematosus. Microb. Pathog. 2020, 148, 104444. [Google Scholar] [CrossRef]
Cunningham, M.W. Streptococcus and rheumatic fever. Curr. Opin. Rheumatol. 2012, 24, 408–416. [Google Scholar] [CrossRef] [PubMed]
Cunningham, M.W. Molecular Mimicry, Autoimmunity, and Infection: The Cross-Reactive Antigens of Group A Streptococci and their Sequelae. Microbiol. Spectr. 2019, 7. [Google Scholar] [CrossRef] [PubMed]
Lucchese, G.; Flöel, A. SARS-CoV-2 and Guillain-Barré syndrome: Molecular mimicry with human heat shock proteins as potential pathogenic mechanism. Cell Stress Chaperones 2020, 25, 731–735. [Google Scholar] [CrossRef]
Ozuna, C.V.; Iehisa, J.C.M.; Gimenez, M.J.; Alvarez, J.B.; Sousa, C.; Barro, F. Diversification of the celiac disease α-gliadin complex in wheat: A 33-mer peptide with six overlapping epitopes, evolved following polyploidization. Plant J. 2015, 82, 794–805. [Google Scholar] [CrossRef]
Molberg, Ø.; Mcadam, S.N.; Körner, R.; Quarsten, H.; Kristiansen, C.; Madsen, L.; Fugger, L.; Scott, H.; Norén, O.; Roepstorff, P.; et al. Tissue transglutaminase selectively modifies gliadin peptides that are recognized by gut-derived T cells in celiac disease. Nat. Med. 1998, 4, 713–717. [Google Scholar] [CrossRef] [PubMed]
Arentz-Hansen, H.; Körner, R.; Molberg, Ø.; Quarsten, H.; Vader, W.; Kooy, Y.M.; Lundin, K.E.; Koning, F.; Roepstorff, P.; Sollid, L.M.; et al. The Intestinal T Cell Response to α-Gliadin in Adult Celiac Disease Is Focused on a Single Deamidated Glutamine Targeted by Tissue Transglutaminase. J. Exp. Med. 2000, 191, 603–612. [Google Scholar] [CrossRef]
Qiao, S.-W.; Bergseng, E.; Molberg, Ø.; Jung, G.; Fleckenstein, B.; Sollid, L.M. Refining the Rules of Gliadin T Cell Epitope Binding to the Disease-Associated DQ2 Molecule in Celiac Disease: Importance of Proline Spacing and Glutamine Deamidation. J. Immunol. 2005, 175, 254–261. [Google Scholar] [CrossRef]
Ruiz-Carnicer, Á.; Comino, I.; Segura, V.; Ozuna, C.V.; Moreno, M.D.L.; López-Casado, M.Á.; Torres, M.I.; Barro, F.; Sousa, C. Celiac Immunogenic Potential of α-Gliadin Epitope Variants from Triticum and Aegilops Species. Nutrients 2019, 11, 220. [Google Scholar] [CrossRef]
Kurochkina, N.; Guha, U. SH3 domains: Modules of protein–protein interactions. Biophys. Rev. 2013, 5, 29–39. [Google Scholar] [CrossRef]
Teyra, J.; Huang, H.; Jain, S.; Guan, X.; Dong, A.; Liu, Y.; Tempel, W.; Min, J.; Tong, Y.; Kim, P.M.; et al. Comprehensive Analysis of the Human SH3 Domain Family Reveals a Wide Variety of Non-canonical Specificities. Struct. 2017, 25, 1598–1610.e3. [Google Scholar] [CrossRef]
Chen, H.I.; Sudol, M. The WW domain of Yes-associated protein binds a proline-rich ligand that differs from the consensus established for Src homology 3-binding modules. Proc. Natl. Acad. Sci. USA 1995, 92, 7819–7823. [Google Scholar] [CrossRef] [PubMed]
Omasits, U.; Ahrens, C.; Müller, S.; Wollscheid, B. Protter: Interactive protein feature visualization and integration with experimental proteomic data. Bioinformatics 2014, 30, 884–886. [Google Scholar] [CrossRef] [PubMed]
Ball, L.J.; Kühne, R.; Schneider-Mergener, J.; Oschkinat, H. Recognition of Proline-Rich Motifs by Protein-Protein-Interaction Domains. Angew. Chem. Int. Ed. 2005, 44, 2852–2869. [Google Scholar] [CrossRef] [PubMed]
Bork, P.; Sudol, M. The WW domain: A signalling site in dystrophin? Trends Biochem. Sci. 1994, 19, 531–533. [Google Scholar] [CrossRef]
Méthot, P.-O.; Alizon, S. What is a pathogen? Toward a process view of host-parasite interactions. Virulence 2014, 5, 775–785. [Google Scholar] [CrossRef]
Ingram, J.H.; Stone, M.; Fisher, J.; Ingham, E. The influence of molecular weight, crosslinking and counterface roughness on TNF-alpha production by macrophages in response to ultra high molecular weight polyethylene particles. Biomaterials 2004, 25, 3511–3522. [Google Scholar] [CrossRef] [PubMed]
Matthews, J.; Green, T.R.; Stone, M.H.; Wroblewski, B.M.; Fisher, J.; Ingham, E. Comparison of the response of primary human peripheral blood mononuclear phagocytes from different donors to challenge with model polyethylene particles of known size and dose. Biomaterials 2000, 21, 2033–2044. [Google Scholar] [CrossRef]
Green, T. Polyethylene particles of a ‘critical size’ are necessary for the induction of cytokines by macrophages in vitro. Biomaterials 1998, 19, 2297–2302. [Google Scholar] [CrossRef]
Paul, D.; Achouri, S.; Yoon, Y.-Z.; Herre, J.; Bryant, C.E.; Cicuta, P. Phagocytosis Dynamics Depends on Target Shape. Biophys. J. 2013, 105, 1143–1150. [Google Scholar] [CrossRef]
Nguyen, H.T.; Pokhrel, A.R.; Nguyen, C.T.; Pham, V.T.T.; Dhakal, D.; Lim, H.N.; Jung, H.J.; Kim, T.-S.; Yamaguchi, T.; Sohng, J.K. Streptomyces sp. VN1, a producer of diverse metabolites including non-natural furan-type anticancer compound. Sci. Rep. 2020, 10, 1756. [Google Scholar] [CrossRef]
Ebadi, M.; Zolfaghari, M.R.; Aghaei, S.S.; Zargar, M.; Shafiei, M.; Zahiri, H.S.; Noghabi, K.A. A bio-inspired strategy for the synthesis of zinc oxide nanoparticles (ZnO NPs) using the cell extract of cyanobacterium Nostoc sp. EA03: From biological function to toxicity evaluation. RSC Adv. 2019, 9, 23508–23525. [Google Scholar] [CrossRef]
Sanchez, C.J.; Kumar, N.; Lizcano, A.; Shivshankar, P.; Hotopp, J.C.D.; Jorgensen, J.H.; Tettelin, H.; Orihuela, C.J. Streptococcus pneumoniae in Biofilms Are Unable to Cause Invasive Disease Due to Altered Virulence Determinant Production. PLoS ONE 2011, 6, e28738. [Google Scholar] [CrossRef]
Karched, M.; Bhardwaj, R.G.; Asikainen, S.E. Coaggregation and biofilm growth of Granulicatella spp. with Fusobacterium nucleatum and Aggregatibacter actinomycetemcomitans. BMC Microbiol. 2015, 15, 114. [Google Scholar] [CrossRef] [PubMed]
Carding, S.; Verbeke, K.; Vipond, D.T.; Corfe, B.M.; Owen, L.J. Dysbiosis of the gut microbiota in disease. Microb. Ecol. Health Dis. 2015, 26, 26191. [Google Scholar] [CrossRef] [PubMed]
Forsberg, G.; Fahlgren, A.; Horstedt, P.; Hammarstrom, S.; Hernell, O.; Hammarstrom, M.-L. Presence of Bacteria and Innate Immunity of Intestinal Epithelium in Childhood Celiac Disease. Am. J. Gastroenterol. 2004, 99, 894–904. [Google Scholar] [CrossRef]
Corazza, G.R.; Strocchi, A.; Gasbarrini, G. Fasting breath hydrogen in celiac disease. Gastroenterology 1987, 93, 53–58. [Google Scholar] [CrossRef]
Tursi, A.; Brandimarte, G.; Giorgetti, G. High prevalence of small intestinal bacterial overgrowth in celiac patients with persistence of gastrointestinal symptoms after gluten withdrawal. Am. J. Gastroenterol. 2003, 98, 839–843. [Google Scholar] [CrossRef]
Valitutti, F.; Cucchiara, S.; Fasano, A. Celiac Disease and the Microbiome. Nutrients 2019, 11, 2403. [Google Scholar] [CrossRef]
Pascual, V. Inflammatory bowel disease and celiac disease: Overlaps and differences. World J. Gastroenterol. 2014, 20, 4846–4856. [Google Scholar] [CrossRef]
Francavilla, R.; Ercolini, D.; Piccolo, M.; Vannini, L.; Siragusa, S.; De Filippis, F.; De Pasquale, I.; Di Cagno, R.; Di Toma, M.; Gozzi, G.; et al. Salivary Microbiota and Metabolome Associated with Celiac Disease. Appl. Environ. Microbiol. 2014, 80, 3416–3425. [Google Scholar] [CrossRef] [PubMed]
Olivares, M.; Neef, A.; Castillejo, G.; De Palma, G.; Varea, V.; Capilla, A.; Palau, F.; Nova, E.; Marcos, A.; Polanco, I.; et al. The HLA-DQ2 genotype selects for early intestinal microbiota composition in infants at high risk of developing coeliac disease. Gut 2015, 64, 406–417. [Google Scholar] [CrossRef] [PubMed]
Ou, G.; Hedberg, M.; Hörstedt, P.; Baranov, V.; Forsberg, G.; Drobni, M.; Sandström, O.; Wai, S.N.; Johansson, I.; Hammarström, M.-L.; et al. Proximal Small Intestinal Microbiota and Identification of Rod-Shaped Bacteria Associated with Childhood Celiac Disease. Am. J. Gastroenterol. 2009, 104, 3058–3067. [Google Scholar] [CrossRef] [PubMed]
Segal, S.; Hill, A.V. Genetic susceptibility to infectious disease. Trends Microbiol. 2003, 11, 445–448. [Google Scholar] [CrossRef]
Zhou, B.; Yuan, Y.; Zhang, S.; Guo, C.; Li, X.; Li, G.; Xiong, W.; Zeng, Z. Intestinal Flora and Disease Mutually Shape the Regional Immune System in the Intestinal Tract. Front. Immunol. 2020, 11, 575. [Google Scholar] [CrossRef] [PubMed]
Stepniak, D.; Koning, F. Celiac Disease—Sandwiched between Innate and Adaptive Immunity. Hum. Immunol. 2006, 67, 460–468. [Google Scholar] [CrossRef]
Kusters, J.G.; van Vliet, A.H.M.; Kuipers, E.J. Pathogenesis of Helicobacter pylori Infection. Clin. Microbiol. Rev. 2006, 19, 449–490. [Google Scholar] [CrossRef] [PubMed]
Ishida, T.; Kinoshita, K. PrDOS: Prediction of disordered protein regions from amino acid sequence. Nucleic Acids Res. 2007, 35, W460–W464. [Google Scholar] [CrossRef] [PubMed]
Kelley, L.A.; Mezulis, S.; Yates, C.M.; Wass, M.N.; Sternberg, M.J.E. The Phyre2 web portal for protein modeling, prediction and analysis. Nat. Protoc. 2015, 10, 845–858. [Google Scholar] [CrossRef]
Case, D.A. AMBER 2015; University of California: San Francisco, CA, USA, 2015. [Google Scholar]
Knapp, B.; Lederer, N.; Omasits, U.; Schreiner, W. vmdICE: A plug-in for rapid evaluation of molecular dynamics simulations using VMD. J. Comput. Chem. 2010, 31, 2868–2873. [Google Scholar] [CrossRef]
Humphrey, W.; Dalke, A.; Schulten, K. VMD: Visual molecular dynamics. J. Mol. Graph. 1996, 14, 33–38. [Google Scholar] [CrossRef]
Zambrano, R.; Jamroz, M.; Szczasiuk, A.; Pujols, J.; Kmiecik, S.; Ventura, S. AGGRESCAN3D (A3D): Server for prediction of aggregation properties of protein structures. Nucleic Acids Res. 2015, 43, W306–W313. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Cartoon summarizing the results and hypothesis presented here. Gluten peptides and pathogens can both enter the intestinal lumen through oral intake. Based on this work, gluten peptides share high sequence, structural and morphological similarities with different proteins found in bacterial pathogens. Some of them share putative T-cell epitopes related to celiac disease (CeD), reinforcing the idea of molecular mimicry as the trigger of CeD (conserved amino acids in the 33-mer sequence and the bacterial proteins are shown (*)). On the other side, a new concept of morphology mimicry is presented, where the immune cells recognize gluten superstructures as bacterial pathogens because of their similar morphology, starting an innate immune response. Exposure to most pathogens and gluten is necessary but not sufficient to cause disease because the genetic background and susceptibility, and dose effects, are additional determinants for disease progression (Created with biorender.com, accessed on 23.08.21, under academic license terms).

Figure 2. Structural characterization of the reduced and oxidized states of α-2-gliadin of Triticum aestivum. Primary sequence of wheat α-2-gliadin where the p31-43 (blue) and 33-mer (red) regions are highlighted (A). Tridimensional structure of α-2-gliadin in the obtained previously [19] and the new disulfide state model (B). The immunogenic regions are in surface representation using the same color reference as in panels A and C, and the cysteine pairs involved in disulfide bonds are marked for clarity. Sequence disorder prediction was calculated using the PrDOS server (C), and the aggregation propensity was calculated using AGGRESCAN 3D (D) for both oxidation states. The horizontal gray line in panel C marks the 5% threshold (false positive rate), while aggregation-prone regions have positive values, respectively. The per residue correlation between both oxidation states is shown in panel (E).

Figure 3. Sequence and structural characterization of target proteins found by BLASTp search of the 33-mer peptide. For both target proteins from Streptomyces viridochromogenes (A) and Fischerella sp (B) the region sharing high sequence similarity with the 33-mer sequence is shown in a red box. The disorder prediction (top panel) of the best hits was calculated by the PrDOS server setting a 5% threshold (false positive rate, gray horizontal line). The percentage of solvent-accessible surface area (SASA) was calculated after an energy minimization of the tridimensional models (middle panel). The structure-based aggregation propensity (lower panel) was calculated using AGGRESCAN 3D, where positive values indicate an aggregation-prone region (horizontal line). The tridimensional structure of the target proteins was modeled using the PHYRE2 server. The regions sharing high sequence similarity with the 33-mer sequence are shown in surface representation. In addition, the sequence alignments between the 33-mer sequence and the high similarity region of the target proteins are shown at the bottom. Partially overlapping copies of T-cell epitope-like sequences are blue underlined in the alignment of panel (A).

Figure 4. Sequence and structural characterization of target proteins found by BLASTp search of the 33-mer peptide. For both target proteins from Nostoc sp. (A) and Streptococcus pneumoniae R6 (B) the region sharing high sequence similarity with the 33-mer sequence is shown in a red box. The disorder prediction (top panel) of the best hits was calculated by the PrDOS server setting a 5% threshold (false positive rate, gray horizontal line). The percentage of solvent-accessible surface area (SASA) was calculated after an energy minimization of the tridimensional models (middle panel). The structure-based aggregation propensity (lower panel) was calculated using AGGRESCAN 3D, where positive values indicate an aggregation-prone region (horizontal line). The tridimensional structure of the target proteins was modeled using the PHYRE2 server. The regions sharing high sequence similarity with the 33-mer sequence are shown in surface representation. In addition, the sequence alignments between the 33-mer sequence and the high similarity region of the target proteins are shown at the bottom. Relevant amino acid exchanges, such as Gln to Glu, are marked in red in the alignment of panel (B).

Figure 5. Sequence and structural characterization of target proteins found by BLASTp search of the p31-43 peptide. In all cases, the regions sharing high sequence similarity with the p31-43 sequence are shown in a blue box. The sequence-based disorder prediction for the MinD/ParA proteins from Streptomyces alboviridis and Streptomyces fulvissimus, the MinD-like ATPase from Streptomyces sp. ScaeMP-e48, and the hypothetical calcium-binding protein from Lentinula edodes, are shown in panels (A–D), respectively, setting a 5% threshold (false positive rate, gray horizontal line). For the latter, the percentage of solvent-accessible surface area (SASA) per residue and aggregation propensity are shown alongside the target model at the bottom right. The atrophin-1 domain is represented in surface (gray), and the region with high sequence similarity to the p31-43 sequence of α-2-gliadin is represented in van der Waals. In addition, the sequence alignment with the target protein is shown.

Figure 6. Predicted subcellular localization for the BLASTp hits proteins using the 33-mer sequence as query. The localization of the high similarity sequence regions from the five best BLASTp hits is shown in red. The SH3-binding motifs are highlighted with a rectangle. The analysis was performed using the Protter server [82]. The partially overlapping copies of the PNPQSPXP sequence in the RND efflux pump protein are shown as a red rectangle. The signal peptides are shown in blue and N-glycosylation motifs are marked as green squares.

Figure 7. Predicted subcellular localization for the BLASTp hits proteins using the p31-43 sequence as a query. The localization of the high similarity sequence regions from the four best BLASTp hits is shown in red. The Y/FPPQ-binding motif recognized by the WW domain is highlighted with a rectangle. The analysis was performed using the Protter server [82]. The N-glycosylation motifs are marked as green squares.

Figure 8. Morphology of host organisms associated with the 33-mer and p31-43 peptide BLASTp hits. Topology of the 33-mer ((A), adapted with permission from Reference [18]) and p31-43 oligomers ((B), adapted with permission from Reference [25]) alongside with the morphology of representative host organisms found in our work: Streptomyces sp. VN1 ((C), adapted from Reference [90]), Nostoc sp. EA03 ((D), adapted from Reference [91]), Staphylococcus pneumoniae TIGR4 ((E), adapted from Reference [92]) and Granulicatella adiacens ((F), adapted from Reference [93]). Images in panels (C–F) are under CC BY licenses.

Table 1. Sequence similarity of target protein regions found for the 33-mer (A) and p31-43 (B) peptides of α-2-gliadin.

Protein Name * (Reference)	Organism	Sequence Identity ** (E-Value)	Sequence Similarity (Subcellular and Topological Localization ***)	Organism Pathogenicity in Humans
BLASTp search using the 33mer sequence as query (A)
Hit#1: PQQ-repeat protein (NCBI: WP_048584697.1)	Streptomyces viridochromogenes	68% (0.004)	⁹⁵PYGQQQDPYGQPQAPYGQPQAPYGQPQP¹²² (T(s-p)/I)	Not reported; producer of bioactive compounds, e.g., antibiotics [31].
Hit#2: Aspartate kinase (NCBI: WP_017312730.1)	Fischerella sp.	54% (0.064)	⁴⁵⁹PIPNPQSPIPNPQSPIPDPRSPIPDP⁴⁸⁴ (I)	Not reported; producer of secondary metabolites [32].
Hit#3: F/YSIRK-type signal peptide-containing protein (NCBI: WP_070445895.1)	Granulicatella sp. HMSC31F03	46% (0.075)	¹⁷⁴⁷QPNPDPEKPTPDPEKPTPDPEKPTPDPE¹⁷⁷⁴ (T(s-p)/E)	Potentially pathogenic [33]; commensal of mucosal surfaces.
Hit#4: Choline-binding protein A (NCBI: WP_000458116.1)	Streptococcus pneumoniae R6	45% (0.079)	⁴¹²KPAEQPQPAPATQPEKPAPKPEKPAEQPK⁴⁴⁰ (E)	Opportunistic pathogen [34]; mucosal surface in upper respiratory tract-forming biofilms.
Hit#5: Efflux RND transporter permease subunit (NCBI: WP_015136936.1)	Nostoc sp.	54% (0.097)	⁵³⁵LPNPQSPIPNPQSPVPNPQSPIPINPL⁵⁶² (T(m-p)/E)	Potentially low pathogenic via ecotoxicology [35]; producer of toxic compounds and bioactive compounds with pharmaceutical potential.
BLASTp search using the p31-43 sequence as query (B)
Hit#1: MinD/ParA family protein (NCBI: WP_032757377.1)	Streptomyces alboviridis	85% (0.003)	⁹¹⁸YPGQQQPYPPQQP⁹³⁰ (I)	Not reported.
Hit#2: MinD/ParA family protein (NCBI: WP_015611830.1)	Streptomyces fulvissimus	85% (0.003)	⁹¹¹YPGQQQPYPPQQP⁹²³ (I)	Not reported.
Hit#3: MinD-like ATPase (GenBank: SCK04981.1)	Streptomyces sp. ScaeMP-e48	85% (0.003)	⁹⁰⁴YPGQQQPYPPQQP⁹¹⁶ (I)	Not reported.
Hit#4: Hypothetical calcium-binding protein LENED-006428 (GenBank: GAW04623.1)	Lentinula edodes	75% (0.024)	⁹⁰⁹FPSQQQPAPYFPPQP⁹²⁴ (I)	Potentially pathogenic [36] causing, e.g., dermatitis herpetiformis; mushroom used in traditional medicine.

* Hits ordered by increasing E-value. ** Sequence identity against 33-mer and p31-43 peptides of α-2-gliadin. *** The sequence and location of the region sharing high sequence similarity with the gliadin peptides within the target proteins are given. Subcellular localization prediction T (s-p or m-p): transmembrane single-pass or multi-pass, E: extracellular, and I: intracellular.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Vazquez, D.S.; Schilbert, H.M.; Dodero, V.I. Molecular and Structural Parallels between Gluten Pathogenic Peptides and Bacterial-Derived Proteins by Bioinformatics Analysis. Int. J. Mol. Sci. 2021, 22, 9278. https://doi.org/10.3390/ijms22179278

AMA Style

Vazquez DS, Schilbert HM, Dodero VI. Molecular and Structural Parallels between Gluten Pathogenic Peptides and Bacterial-Derived Proteins by Bioinformatics Analysis. International Journal of Molecular Sciences. 2021; 22(17):9278. https://doi.org/10.3390/ijms22179278

Chicago/Turabian Style

Vazquez, Diego S., Hanna M. Schilbert, and Veronica I. Dodero. 2021. "Molecular and Structural Parallels between Gluten Pathogenic Peptides and Bacterial-Derived Proteins by Bioinformatics Analysis" International Journal of Molecular Sciences 22, no. 17: 9278. https://doi.org/10.3390/ijms22179278

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Molecular and Structural Parallels between Gluten Pathogenic Peptides and Bacterial-Derived Proteins by Bioinformatics Analysis

Abstract

1. Introduction

2. Results and Discussion

2.1. High Sequence and Structural Similarity of 33-mer and p31-43 Sequences in Pathogen-Derived Proteins

2.2. The Structural Evaluation of Gliadin Peptides Harboring Pathogen-Related Proteins Similarity Regions and Their Function

2.2.1. Spatial Localization and Structural Information of the Relevant Gliadin-Derived Peptides in the α-2-Gliadin 3D Model

2.2.2. The Homology Models of Pathogen-Related Proteins Containing Sequences Similar to 33-mer Gliadin Peptide and Their Function

2.2.3. The Homology Models of Pathogen-Related Proteins Containing the p31-43 Similar Sequence and Their Function

2.3. Primary Structure Analysis and Potential Functions

2.3.1. CeD-T-Cell Epitopes

2.3.2. SH3/WW Domains Binders

2.4. Gliadin Peptide Superstructures, Pathogen Morphology, and Dysbiosis as Triggers of the Innate Immune Response: The Hypothesis

3. Materials and Methods

4. Conclusions

Supplementary Materials

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI